K | # of bigrams | # of trigrams | # of 4-grams | # of 5-grams | # of 6-grams |
---|---|---|---|---|---|
100 | 46 | 86 | 89 | 96 | 99 |
1000 | 152 | 408 | 661 | 832 | 928 |
10000 | 657 | 2059 | 3999 | 6011 | 7598 |
100000 | 2480 | 14426 | 34533 | 54549 | 69652 |
1000000 | 7642 | 55255 | 184567 | 377788 | 554670 |
Both the problem and the results are much similar to the previous subsection: We consider letter-N-grams at the end of words instead of the beginning.
3.8.1 Number of letter-N-grams at word beginnings